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' A LINGUISTIC  APPROACH  TO  AUiOMATIC  THEOKEM  PROVING 

Sharon  Sickel 
Information  Sciences 

University  of  California,  Santa  Cruz,  Ca. 

ABSTRACT  ' 

Generalizing  the  concept  of  a path  In  Clause  Interconnectivity  Graphs, 
we  define  the  set  of  simple  (I.e.,  cycle-free)  paths  that  begin  at  a specified 
subset  of  nodes.  Where  the  search  of  the  CIG  for  a proof  In  the  predicate 
calculus  was  previously  defined  In  terms  of  the  edges  of  the  CIG,  here  the 
simple  paths  themselves  become  the  atomic  elements  of  the  search,  thereby 
Increasing  the  "chunk"  size  of  the  operands.  We  can  further  define  forms 
similar  to  regular  expressions  In  which  the  terminal  symbols  represent  those 
simple  chunks.  The  forms  become  templates  that  model  proofs,  I.e.,  they  can 
be  mapped  onto  resolution  proofs  of  the  unsatlsf lability  of  the  clauses  making 

up  the  CIG.  In  general  a template,  represents  an  Infinite  number  of  paths  but  ^ 

1 

an  algebraic  computation  on  Information  derived  from  the  t*'!,..  ’•es  yields  valid 
proofs  without  an  exhaustive  search  through  Intermediate  - ..  of  the  search 
tree.  Overall,  the  method  leads  to  a reduction  In  both  the  computation  time 
per  step  as  well  as  In  the  combinatorics  of  the  search  Itself.  The  representa- 

i 

tion  also  lends  Itself  to  an  heuristic  based  on  Integer  programming  by  using  a 
simple  difference  function  based  on  the  chunks. 

Introduction 

A system  for  formal  theorem  proving  Is  presented,  using  the  Clause 
j Interconnect! vtty  Graph  as  Its  basic  data  structure.  Proofs  found  here  can  be 

mapped  onto  proofs  using  resolution  and  factoring  as  rules  of  inference  (as 
opposed  to  Modus  Ponens,  for  example).  The  search  method  bears  little  resemblance 
to  that  of  resolution  methods,  however. 

The  Clause  Interconnectivity  Graph  (CIG)  [S]  has  been  used  as  a representa- 

Itlon  for  proving  first-order  predicate  calculus  theorems.  A CIG  Is  a four-tuple: 

< Nodes,  Edges,  Subst,  Clause  > where 

_ 1 
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Nodes  Is  a set  of  graph  nodes*  one  for  each  literal  of  each  clause* 

Edges  Is  a synmetrlc  relation  bettifeen  nodes  such  that  <a*b>  f 
Edges  Iff  the  literals  associated  with  nodes  a and  b have 
opposite  signs  and  uniflable  atoms.  i 

..Suhst  Is  a mapping:  Edges  substitutions  such  that 

Subst(<a*b»)  Is  a most  general  unifier  of  the  atoms  of  the 
literals  associated  with  nodes  a and  b*  and 
Clause  Is  a mapping:  Nodes  •♦'i^(Nodes)  where  ^means  powerset; 

Clause  partitions  the  nodes  so  that  literals  In  the  same 
clause  have  corresponding  nodes  In  the  same  partition, 
for  example*  suppose  that  we  are  dealing  with  Integers  defined  by  Peano's 
axioms*  and  we  define  the  predicate*  Even: 

Even(0] 

Even(s"(0)) 

feven(s"(0))  -*•  Even(s”‘^^(0)) 

and  theorem  Even(s®°(0)).  Then  the  CIG  Is  shown  In  Figure  1.  

r aOC'^SSION  tor 


Even(s  (0)) 


Rm(s"(0)) 


Even(O) 


figure  1.  A Clause  Interconnectivity  Graph  with  labeled  edges.  The 

■ ■ A A A MM  A A ^ J ^ I ^ ..  ..  ft. 


fre^cates  and  terms  are  left  In  the  nodes  for  expository  purposes  only, 
hey  are  neither  Included  In  the  CIG  definition  nor  are  they  used  In  the 
search  for  a proof. 

t means  "successor";  s(0)  ■ 0;  s"(0)  ■ s(s"'Uo))  for  n > 0. 


o □ 
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Edges  Is  a symmetric  relation,  However»  when  we  Involve  an  edge  In  the 
search,  the  analogy  Is  made  to  moving  from  one  element  of  an  ordered  pair  In 
Edges  to  the  other  element  In  that  pair.  Therefore  when  an  edge  Is  used,  we 
. think  of  It  as  being  directed.  Given  an  edge  <a,b>  and  assuming  direction  a b, 

we  can  make  the  following  definitions. 

Deletingjiteral  Is  a mapping:  Edges  •«>  Nodes  where 
Delet1ng_J1teral(<a,b>)  ■ b and 

Residualjiterals  Is  a mapping:  Nodes  6^(Nodes]  where  ^ means  powerset. 

Resldualjlterals(b)  ■ Clause(b)  - (bl. 

A proof  derived  from  a CIG  corresponds  to  a particular  kind  of  search  on 

the  CIG.  The  proof  search  resembles  the  following  process: 

Choose  a clause  to  be  the  starting  clause  (a  clause  that  Is  likely 
to  be  used  In  the  proof,  a member  of  the  set  of  support,  etc.).  Place 
a marker  on  each  of  the  nodes  In  the  partition  representing  the  starting 
clause.  Each  of  those  markers  may  be  moved  along  any  edge  connected  to 
Its  present  position.  Then  the  parent  marker  Is  removed  (from  the  deleting 
node)  and  children  markers  are  placed  on  each  of  the  other  nodes  (the 
residual  nodes)  in  the  partition  arrived  at  from  the  move.  Then  the 
process  Is  repeated  on  ajl_  of  the  existing  markers;  they  1n  turn 
become  parents,  being  replaced  by  children.  The  goal  Is  to  eliminate 
all  markers, This  process  corresponds  to  unrolling  the  graph  Into  trees. 

From  looking  at  the  CIG  In  Figure  1,  It  Is  easy  to  see  that  some  move 
sequences  could  be  done  an  arbitrary  number  of  times,  e.g.,  moves  D,F,D,F,... 
successively,  or  E,C,E,C,...  Ne  call  such  sequences  loops. 

Assuming  starting  clause  Even(O),  the  first  move  Is  determined,  namely  6. 
That  leaves  a marker  on  the  node  corresponding  to  Even(s”''’^(0)).  From  this  node 
we  could  begin  one  of  the  loops  mentioned  above.  Let  us  consider  a sequence 
of  moves  involving  one  of  the  loops;  G(DF)  DA,  meaning  move  along  G,  then 
around  D and  F k times,  then  along  0,  then  A.  Intuitively  G links  up  the  integer 

t This  process  Is  over-simplified.  There  are  restrictions  concerning  the 
substitutions,  and  there  Is  another  allowable  move  that  admits  non-input  steps. 
For  a complete  description,  see  [S]. 
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0 with  the  start  of  an  Induction.  The  DF  loop  adds  the  value  2 to  the  current 

value.  Move  A Jumps  out  of  the  induction  to  the  value  that  we  seek.  In  other 
k 

words,  the  G(DF)  part  Is  successively  proving  that  0 is  even,  2 Is  even,  4 Is 
even,  etc.,  until  we  arrive  Just  short  of  the  given  value.  The  D and  A steps 
together  add  2 to  the  value.  In  this  case,  k will  have  the  value  29. 

Once  we  have  dlscovere^'GCOF)  DA,  proofs  of  the  evenness  of  all  even, 
positive  Integers  should  be  equally  easy  In  all  systems.  But  we  know  that 
they  are  not.  Using  traditional  deductive  systems  on  this  axiomatizatlon,  the 
length  of  the  proof  of  Even(s'^(0)}  Increases  linearly  with  n,  and  required 
resources  generally  Increase  exponentially  with  the  length  of  the  proof.  In 
this  method,  however,  the  discovery  of  the  proof  Is  of  the  same  Inherent 
difficulty  regardless  of  the  magnitude  of  n.  The  approach  Involves: 

1)  mapping  the  CIG  onto  a context>free  grammar  [1] 

2)  mapping  the  context-free  grammar  onto  a set  of  expressions  similar 
to  regular  expressions. 

3)  mapping  each  regular  expression  onto  a composition  of  substitutions. 

4)  checking  to  see  If  any  of  the  expressions  represent  a legal  substitution. 
If  so,  that  expression  can  be  mapped  onto  a proof. 

Chunking 

The  previously  presented  sllarch  schemes  on  CIG's  dealt  with  looping  by 
preferring  non-loop  moves,  preventing  run-away  development  of  Infinite  loops. 
However,  even  In  srnne  simple  cases,  we  may  need  to  travel  a loop  many  times. 

One  example  of  this  Is  the  proof  of  evenness  In  which  we  should  be  able  to  prove 
Even(6000)  easily  once  the  general  method  Is  discovered.  The  proof  Itself  may 
be  long,  but  the  search  time  should  be  Identical  to  the  search  time  In  proving 
Even(60)  or  Even(6).  In  fact.  It  Is  possible  to  use  this  method  not  only  to  . 
prove  individual  theorems,  but  also  to  derive  generalized  algorithms  to  do 
computations  within  a theory. 

Once  we  know  the  basic  steps  needed  for  a proof,  the  repetition  of  one  or 
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more  of  those  steps  a large  number  of  times  should  not  cause  us  any  trouble. 

He  need  to  discover  these  basic  steps  or  chunks.  One  might  Imagine  that  the 
moves  that  correspond  to  edges  might  serve  satisfactorily  as  chunks.  However» 
there  Is  some  obvious  clumping  that  takes  place.  The  CIG  In  Figure  2 has  three 
natural  chunks,  C<|:f,  C2:deg,  C2:abc,  because  the  moves  within  each  chunk  must 
be  taken  together.  Note  that  denotes  a loop,  and  we  can  travel  In  either 
direction  on  a loop,  so  we  can  denote  cba  as  In  this  case,  the  chunking 

partitioned  the  edges,  but  that  will  not  necessarily  be  the  case. 


Figure  2.  A CIG  divided  Into  Its  three  natural  chunks. 

We  can  derive  the  chunks  by  finding  all  ways  of  moving  and  replacing  the 
markers  such  that  If  a marker  Is  on  the  same  node  as  one  of  Its  ancestors,  we 
freeze  that  marker,  but  continue  to  move  other  available  markers.  The  starting 
configuration  for  each  chunk  Is  a single  marker  sitting  on  some  node.  The 
chunk  Is  said  to  be  related  to  that  node.  Intuitively,  the  chunk  represents 
the  refutation  of  the  literal  that  the  related  node  represents.  This  process 
Identifies  all  of  the  natural  pieces  of  the  graph.  Since  no  repeated  looping 
Is  allowed,  this  Is  a terminating  process. 

We  classify  the  chunks  Into  two  types,  terminal  and  loop.  A terminal 
chunk  Is  one  In  which  all  markers  have  been  eliminated.  A loop  chunk  has  one 
or  more  frozen  markers.  In  Figure  2,  C.  and  C,  are  terminal  chunks;  C«  Is  a 
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loop  chunk. 

Chunks  to  Context-Free  Grawmar 

Chunks  as  described  In  the  previous  section  are  trees,  since  !)  a parent 
marker  may  be  replaced  by  one  or  more  children  markers  and  2)  no  marker  can 
ever  be  Its  own  descendant.  He  wish  to  write  the  chunks  as  linear  sequences 
so  that  we  can  use  them  In  constructing  a grammar.  We  produce  this  flattening 
by  doing  an  end-order  traversal  [4]  of  the  "chunk  tree".  The  flattened  form 
Is  a sequence  of  directed  edges  and  nodes,  s^,  S2t  ...  s^.  We  can  make  context- 
free  productions  by  putting  s.|,  S2»  ...  s^^  on  the  right-hand-side  and  the 
associated  node  on  the  left-hand-side, 

N s<|S2...s^. 

The  intuitive  notion  Is  that  to  eliminate  N you  must  add  s^,S2i...s^  (possibly 
Including  N).  We  can  now  construct  a context-free  grammar  G: 
nonterminals:  (S)  U Nodes  (where  {S}  (1  Nodes  > <t>) 
terminals:  Edges 

productions:  (all  N -*■  SiS2...s^  as  described  above} 

U' {S -»•  N^...N|^  1 N^,.,,N|^  represent  all  literals  In 


starting  clause) 


start  symbol:  S 


In  the  ground  case  any  string  In  the  language  of  G,1.e.  any  string  that 
is  derivable  from  S and  consists  entirely  of  terminals  (In  this  case  edges), 
represents  a proof.  Therefore,  once  the  chunking  is  accomplished,  determining 
theoremhood  of  the  statement  In  question  Is  equivalent  to  asking  whether  a 
given  context-free  grammar  generates  a non-empty  language,  which  Is  a trivial 
problem. 

The  general  case  Is  more  difficult,  however.  Each  edge  has  an  associated 
substitution,  and  for  a string  of  edges  to  be  acceptable,  all  of  their 
substitutions  must  be  mutually  consistent.  Consl£ten^(o.|,a2....a^)  Iff 
0 (o2  ® (.••  ® Op)))  Is  defined,  where  a o 0 ■ y such  that  y Is  a most 
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I general  substitution  satisfying  (Lo)y  « (Ly)oi  ■ Ly  ■ (Le)y  “ (Ly)3  for  an 

arbitrary  literal  L [5].  Since  all  terminal  strings  must  abide  by  consistency, 
this  Is  In  fact  a context-free  attribute  grammar  [3]  and  can  have  the  power  of 
I * a type  0 gramnar.  This  fact  eliminates  the  usefulness  of  the  result  that  tells 

! us  there  Is  an  upper  bound  on  the  length  of  the  shortest  string  in  the  language. 

However,  the  grammar  form  provides  us  with  some  valuable  heuristics  as  we  shall 
see  later. 

I Regular-1 Ike  Expressions 

I Given  a context-free  gramnar.  It  would  be  convenient  to  represent  the 

I ■ language  generated  In  regular  expression  style.  To  do  that,  we  need  to  extend 

I the  definition  of  regular'- expression.  In  addition  to  "1",  meaning  "or”, 

j concatenation  meaning  "and",  and  meaning  "repeat  zero  or  more  times",  we 

! * 

; add  exponent  " iean  repeat  exactly  n times.  For  the  grammar  constructed 

In  the  previ  -lun,  If  all  productions  that  have  node  N on  the  left-hand- 

, side  have  one  of  t^,...t^  (terminal  chunks),  or  r^N,..,r|^N  (loop  chunks),  then. 

Intuitively,  the  expression  (r^  ^2! ..  .r|^)*t^  | .. . |t^  represents  the  refutation 

I for  N and  we  denote  it 

* * * 

i . . h ■>  (r^ |**2l . . . jrj^)  (ti|...|t^). 

I.e.  we  can  go  around  loops  as  long  and  in  whatever  order  we  choose,  but  we 

1 

must  finally  end  with  a terminal 
In  the  example  In  Figure  2, 

0 ■>  (abc)*deg,  ©■>  (cba)*f. 

It  may  be  that  by  the  above  recursion  method  and  by  simple  back-substitution 
for  nonterminals  of  right-hand-sides  having  the  corresponding  nonterminals  on 
the  left,  we  can  derive  S ■>  PiP2***Pn  P^  ^ Edges,  For  the  example  of 
Figure  2,  the  grammar  is; 


t This  notation  appears  frequently  In  the  literature  on  formal  languages, 
a A ■>  B means  B can  be  derived  from  A by  an  application  of  zero  or  more 
productions. 
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,c»d,e»f,g}«  P,  S)  where  P: 

s-^  ©d) 

©•*•  a b c (T) 

0-*-  deg  ' 

c b a ® 

©■^  f 

AW  A 

By  back-substitution' we  get:  S ■>  (abc)  deg(cba)  f.  Now  by  replacing  each 
terminal  by  Its  substitution  and  Interpreting  concatenation  of  substitutions 
to  mean  ® , we  can  easily  determine  whether  there  exist  non-negative  Integers 
n and  m such  that  subst'^Cabc)  © subst(def)  0 subst'”(cba)  0 subst(f)  Is 
defined.  Note  that  wp  have  replaced  whole  chunks  by  their  substitutions.  The 
substitution  of  a chunk  Is  the  0 composition  of  the  substitutions  of  the  edges 
making  up  the  chunk.  Each  time  a loop  Is  repeated  a new  Instance  of  the  clause 
at  the  endpoints  of  the  loop  Is  added.  For  this  reason*  a loop  repeated  n 
times  will  have  n distinct  Instances  of  the  variables.  Loop  substitutions, 
then,  must  be  abstract  descriptions  Including  an  unknown  number  of  instances 
of  variables.  For  example  the  substitution  [f(x^)/Xj^^^]  specifies  that  each 
new  Instance  of  x Is  replaced  by  function  ''f  applied  to  the  term  substituted 
for  the  last  Instance  of  x. 


For  example,  the  grammar  built  from  the  CIG  In  Figure  1 having  Even(O) 

as  the  start  clause  would  cause  S to  generate  (among  others)  the  expression 

* 

G(DF)  DA.  The  corresponding  substitution  e Is 
[0/n]  © [n+l/m,  , m+l/n]*  © [n+l/m]  e [59/m], 


"1  “ ”1  ^ 1^ 

"1.1  • "1 
(isl)  J 


m^  ■ 21+1 
n^  • 21 
(isl) 


S The  other  nonterminal  names  and  their  productions  are  Irrelevant  to  this 
discussion. 
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Differentiating  between  Instances  of  variables,  e becomes  [O/Oq]  O [21/n^» 
21-l/m^_l3  ® [nj^+l/mj^]  © [59/mj^]  where  1 < 1 < k.  mj^  » 59  - = 2k+l, 

therefore  k » 29,  Indicating  that  the  refutation  consists  of  G,  twenty- 
' nine  repetitions  of  (CF)  and  finally  D and  A.  We  will  not  go  Into  how  to 
generally  describe  loop  substitutions,  decide  which  instances  of  a variable 
are  referred  to  by  other  substitutions,  or  compute  the  exponent  of  loops. 

However,  for  a given  expression  that  Is  a regular  expression  extended  by 
exponents  and  contains  no  node  names  (I.e.,  Is  completely  terminal).  It  is 
straightforward  to  answer  those  questions.  Due  to  lack  of  space  the  algo- 
rithms will  be  presented  In  a subsequent  paper. 

Integer  Progratrminq  Heuristic 

There  will  be  grammars  derivable  from  CIG's  that  do  not  easily  admit  the 
extended  regular  expressions.  They  Include  1)  grammars  in  which  the  self- 
referencing  non-terminal  appears  In  the  middle  of  the  right-hand-side  (e.g., 

N aNb)  and  2)  grammars  In  which  a nonterminal  can  generate  a string  con- 
taining two  copies  of  Itself,  e.g.,  N *>  aNNe  where  a and  e are  possibly 
empty  strings  of  symbols,  I.e,,  a,e  € (Edges  U Nodes)*.  In  the  latter  case, 

. It  Is  difficult  to  see  the  general  recursion  pattern  since  the  length  of  the 
resulting  string  Is  exponential  with  the  number  of  repetitions.  In  both 
cases  keeping  track  of  which  Instances  of  the  variables  to  put  in  each  sub- 
stitution Is  a horrendous  Job  In  general. 

By  weakening  the  grammar,  allowed  by  Its  particular  use  in  this  application, 
and  not  by  distinguishing  between  different  instances  of  the  same  variable,  we 
can  always  derive  an  extended  regular  expression  reduced  to  terminals,  the 
terminals  possib'y  reordered  from  what  the  grammar  would  actually  generate. 

Every  chunk  has  a (possibly  empty)  effect  on  the  total  substitution  in  a 
solution.  Terminal  chunks  have  a fixed  effect.  Loop  pieces  may  have  a recur- 
sive effect.  E.g.,  has  the  effect  of  adding  f to  the  accumulated 

effect  and  applying  It  to  the  new  "x". 
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By  combining  the  Information  from  the  reordered  extended  regular 
expression  and  the  chunk  effects,  it  Is  possible  to  write  Integer  programming 
problems[2]  whose  solutions  are  likely  candidates  for  proofs.  In  this  way, 
the  effects  serve  as  difference  functions  for  the  chunks  (operators)  in  much 
the  way  as  is  done  in  an  operator  difference  table.  The  integer  program 
tells  us  how  many  applications  of  each  operator  there  are  in  likely  candi- 
dates. The  structure  of  the  original  grammar  can  then  be  used  to  check  the 
validity  of  that  candidate.  An  example  of  this  Is  the  "Even"  problem  in  which 
we  need  to  change  the  term  from  "0"  in  the  start  state  to  "s^°(0)"  in  the  goal 
state.  Therefore  the  sum  of  the  effects  of  the  chunks  used  must  sum  to  exactly 
sixty  applications  of  "s".  In  some  cases,  the  start  and  goal  states  are  not 
so  clearly  known  and  we  have  to  phrase  the  problem  slightly  di ffererif'y  such 
that  the  original  terms  used  In  the  solution  plus  the  effects  of  all  applied 
chunks  sum  to  zero. 

In  cases  where  the  regular  expression  forms  are  exactly  known,  the  Integer 
programning  heuristic  is  substantially  improved  because  the  proper  placement 
of  variable  instances  is  known.  We  may  then  break  the  problem  Into  subproblems  - 
one  for  each  variable. 

Work  on  the  integer  programming  heuristic  and  computation  of  effects  of 
more  complex  loops  is  currently  in  progress. 
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